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Conformational landscapes of DNA polymerase I 
and mutator derivatives establish fidelity 
checkpoints for nucleotide insertion 
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The fidelity of DNA polymerases depends on conformational changes that promote the 
rejection of incorrect nucleotides before phosphoryl transfer. Here, we combine single- 
molecule FRET with the use of DNA polymerase I and various fidelity mutants to highlight 
mechanisms by which active-site side chains influence the conformational transitions and 
free-energy landscape that underlie fidelity decisions in DNA synthesis. Ternary complexes of 
high fidelity derivatives with complementary dNTPs adopt mainly a fully closed conformation, 
whereas a conformation with a FRET value between those of open and closed is sparsely 
populated. This intermediate-FRET state, which we attribute to a partially closed conforma- 
tion, is also predominant in ternary complexes with incorrect nucleotides and, strikingly, in 
most ternary complexes of low-fidelity derivatives for both correct and incorrect nucleotides. 
The mutator phenotype of the low-fidelity derivatives correlates well with reduced affinity for 
complementary dNTPs and highlights the partially closed conformation as a primary 
checkpoint for nucleotide selection. 
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The remarkable fidelity of DNA synthesis is crucial for 
genome stability^ This fidelity depends on non-covalent 
transitions preceding the chemical step of phosphoryl 
transfer and serving as checkpoints whereby inappropriate 
substrates are rejected early during the reaction^. Among these 
transitions, the fingers- closing transition is of particular interest: 
the binding of the complementary dNTP to a polymerase- DNA 
(Pol-DNA) binary complex leads to a conformational change in 
which a part of the fingers subdomain (including the O-helix) 
moves from an open to a closed conformation (Fig. la-c), and 
forms a tight binding pocket around the nascent base pair^~^. 
Previously, we used single-molecule fluorescence resonance 
energy transfer (FRET) to examine fingers- closing in the 
Escherichia coli DNA polymerase I (Klenow fragment) (KF; 
see Santoso et al7). Our results identified the open and 
closed conformations that predominate in the binary Pol-DNA 
and ternary Pol-DNA-dNTP (complementary) complexes, 
respectively, and showed that the unliganded polymerase 
explores both open and closed conformations^" We also 
showed that ternary complexes with non- complementary 
dNTPs or complementary ribonucleotides form distinct FRET 
species (with FRET signals intermediate between the open and 
closed states). 

The fidelity and speed of DNA replication can be profoundly 
affected by substitution of amino acids near the polymerase 
active site^^"^"*, implying that specific side chains control 



the discrimination between correct and incorrect nucleotides. 
Two well-studied amino acids contributing to fidelity in KF are 
E710 and Y766 (Fig. la), both universally conserved within 
A-family polymerases^^. Residue E710 helps in positioning the 
incoming nucleotide by interacting with the dNTP base-sugar 
portion, and acts as a steric gate that blocks incorporation of 
ribonucleotides^^. Residue Y766 stacks on the template base of 
the terminal base pair and blocks the insertion site in the binary 
complex (Fig. lb). Upon fingers-closing, Y766 rotates away from 
the terminal base pair and is replaced by the templating base, 
which moves into the catalytic pocket (Fig. Ic); the hydrogen- 
bonded Y766-E710 pair creates part of the binding pocket and 
controls the nascent base-pair geometry. 

Here, we report a conformational analysis of wild-type KF (wt 
KF) and KF derivatives with single amino-acid substitutions at 
E710 and Y766, respectively, and obtain conformational energy 
landscapes for various complexes along the prechemistry path- 
way. Three of the substitutions examined (E710A, Y766A and 
E710Q) decrease fidelity, acting as 'mutators'^ we also 
analysed Y766F, a derivative with enzymatic properties similar 
to wt KF^l'^l 

Since mutators fail to transition efficiently from the partially 
closed to the closed conformation, our results indicate that the 
partially closed state is an important fidelity checkpoint and that 
residues E710 and Y766 are essential for the discrimination 
between complementary and non -complementary nucleotides. 
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Figure 1 | Structural view of DNA polymerase I and key residues in this study, (a) The open Pol-DNA binary complex (PDB accession code 1L3U) is 
illustrated using structural data from Bacillus stearothermophilus (Bst) DNA polymerase^, a close homologue of KF. In the following, the KF numbering 
scheme is used. The fingers subdomain is coloured blue with the exception of the 0- and 01-helices coloured beige, the thumb green, the palm subdomain 
red and the 3'-5' exonuclease grey. The DNA template strand is coloured brown and the DNA primer strand black. The amino acid Y766 is coloured 
magenta and E710 cyan. The positions of the residues used for labelling are shown as a red sphere for the acceptor (residue 550) and a green sphere for 
the donor (residue 744). See also Supplementary Methods and Supplementary Table SI for details regarding site-specific labelling, (b) Position of E710 and 
Y766 in the Pol-DNA binary complex (PDB accession code 1L3U). The O-helix is in the open conformation, Y766 is stacked on the template base 
of the terminal base pair and the next templating base is in a pocket between the 0- (mobile) and 01- (immobile) helices. The last two base pairs of the 
template-primer duplex are in grey, (c) Position of E710 and Y766 in the Pol-DNA-dNTP (correct) ternary complex (PDB accession code 1LV5 (ref. 4)). 
With the O-helix in the closed position, Y766 has moved away from the terminal base pair, creating space for the templating base. Y766 forms a hydrogen 
bond with E710, shown as dashed black line, (d) DNA oligonucleotides used in this study. A dideoxy nucleotide (3'-H) prevents extension of the 
3' terminus. The templating base is shown in red. 
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Results 

wt KF shows an on-pathway intermediate. The conformational 
profiles for wt KF alone and in complexes with non- extendable 
DNA substrates (Fig. Id) are shown in Fig. 2a. The unliganded 
polymerase exhibits a broad FRET distribution (Fig. 2a, row 1), 
slightly biased towards an open conformation (60% occupancy, 
£'^iowFRET~0-48). Formation of the KF-DNA binary complex 
(using DNAl, Fig. Id) further increases the open- state occupancy 
(81%, E'^open^ 0-45; Fig. 2a, row 2). The peak positions of the low- 
FRET species are slightly different in the unliganded polymerase 
and the binary complex; this is mainly because of unliganded 
polymerase dynamics^'^'^^, but may also be due to 
conformational states other than the open and closed states, as 
discussed below. 

Addition of the complementary nucleotide (A-dTTP) forms a 
ternary complex, wherein ~ 85% of all molecules display a closed 
conformation (£'^ciosed~0-66; Fig. 2a, row 3). Notably, the 
presence of 1 mM dTTP (instead of 10 jiM used previously^) 
increases significantly the E"^ of the low-FRET species 
(E'^iowFRET^O-^l) compared with the E'^open in the binary 
complex (see Supplementary Fig. SI). Indeed, the ^'^iowfret 

for 

1 mM dTTP is closer to the intermediate- FRET species 
(£'*^~0.48) also populated by complexes containing 1 mM of 
either non -complementary dNTP or complementary rNTP (A- 
dGTP and A-rUTP, respectively; Fig. 2a, rows 4 and 5). For both 
A-dGTP and A-rUTP ternary complexes, the occupancy ratio of 
the £"^^0.48 FRET species and the closed conformation is ~9:1. 

We measured nucleotide-binding affinities by analysing FRET 
histograms generated at different nucleotide concentrations 
(Fig. 3a). The equilibria between the FRET species are depicted 
in Fig. 3c: the initial binding of a nucleotide (N) to the open 



binary complex (O) giving the intermediate- FRET species (IN) is 
described by the association constant Ki (or its reciprocal Kji), 
whereas the conformational equilibrium between (IN) and the 
closed ternary complex (CN) is described by constant K2. In 
addition, the binary complex is in equilibrium between open (O) 
and closed (C) states, described by constant Kq. The nucleotide 
titrations provide two observables (Fig. 3a): the first is the peak 
shift of the low-FRET Gaussian (comprising the unresolvable O 
and IN species); the halfway point, corresponding to [O] = [IN], 
is at a nucleotide concentration equal to K^i. The second 
observable is the fraction of polymerase complexes in high-FRET 
conformations. We used global fitting of both observables to 
obtain Kq, Ki and K2 (Table 1). Extensions of our model are 
possible without altering its general conclusions; a five-state 
model including a potential intermediate -FRET species in the 
binary complex led only to a small additional scaling factor for 
K^i (see Supplementary Methods). 

With wt KF, the complementary dNTP (A-dTTP) forms the 
intermediate- FRET species with a K^i of 5.1 jiM (Table 1); the 
favourable equilibrium towards the closed complex (^2 = 6.1) 
augments the binding affinity so that the closed conformation is 
stabilized (dissociation constant of the closed ternary complex, 
Kd2 — 1/(^1^2) = 0-83 |iM). The affinity of incorrect nucleotides 
or complementary rNTPs for the intermediate state is weaker 
(Kdi: 46|iM for A-dGTP, 210 jiM for A-rUTP and 30|iM for 
A-rGTP; Table 1) and the low values of K2 (<0.2 in all cases) 
inhibit full fingers-closing. 

Titrations of wt KF with nucleotides in the absence of DNA 
(Supplementary Fig. S2e) show formation of an intermediate- 
FRET species, as reported previously^. The initial binding affinity 
(3.0 |iM for dGTP; 12|iM for rGTP) is similar to that for a 
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Figure 2 | Conformational profiles for wt KF and its derivatives. Histogranns of the FRET efficiency P for wt KF (a), E710A (b), E710Q (c), Y766A 
(d) and Y766F (e). Each histogrann (70 bins) was derived from several thousand fluorescence bursts detected fronn labelled polynnerases diffusing through 
a confocal spot. The histograms were fitted to a double-Gaussian model to determine the relative occupancies (shown as percentages) and mean 
peak positions of the individual FRET species (black lines, sum of Gaussians; grey lines, individual Gaussians). First row: unliganded polymerase (Pol). 
Second row: binary complex, formed by adding DNAl (with A as the templating base, final DNA concentration: 100 nM) to the polymerase (Pol + DNA). 
Third row: ternary complex with the correct nucleotide, formed by adding dTTP (1 mM) to the binary complex. Fourth row: ternary complexes with an 
incorrect nucleotide, formed by adding dGTP (1 mM) to the binary complex. Fifth row: ternary complex with the complementary ribonucleotide, formed 
by adding rUTP (1 mM) to the binary complex. The two dotted vertical lines mark the mean P values of the open and closed FRET species in the binary 
complex. Histograms for each enzyme were measured on the same day. The dashed line marks the mean P value of the intermediate-FRET species 
as populated in the ternary complexes (see also Supplementary Fig. SI). 
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Figure 3 | Determining equilibrium constants for wt KF and E710A. (a,b) Nucleotide dependence of FRETobservables associated with the formation of 
ternary complexes of wt KF (a) and E710A (b) in the presence of DNA1. The fraction of the molecules in the high-FRET species (black circles) and the 
shift of the mean P of the low-FRETspecies (blue triangles) were plotted as a function of nucleotide concentration. The shift was normalized relative to the 
P difference between the means of the open and closed conformations. Data points with error bars are represented as mean±s.e.m., derived from 
three independent measurements (see Supplementary Fig. S2 for examples of titrations). The data were globally fitted using a four-state equilibrium 
model (c). See Table 1 for results, (c) Four-state model describing the nucleotide-concentration-dependent FRETobservables (for details, see main text and 
Methods). Kq represents the equilibrium between open and closed forms of the binary complex (0: open, C: closed). Upon nucleotide (N) addition, 
the nucleotide-bound intermediate-FRET state (IN) and closed state (CN) are formed sequentially. The nucleotide-dependent equilibria are described 
by Ki and K2, respectively. The dissociation constants /Cdi and /Cd2 are calculated from Ki and K2. 



Table 1 | Equilibrium constants for nucleotide binding/ 



Complex 


Constants^ 


wt KF 


E710A 


E710Q 


Y766A 


Y766F 


A-dTTP* 


/Cdi (|iM) 
K2 

/Cd2 (|iM) 


5.1 ±0.6 
6.1 ±0.5 
0.83 ±0.12 


43 ±10 
0.25 ±0.01 
170 ±40 


120 ±27 
0.52 ±0.03 
230 ±50 


80 ±30 
1.0 ±0.1 
79 ±28 


18±4 
9.6 ±1.7 
1.9 ±0.6 


A-dGTP* 


/Cdi (^iM) 
K2 


46 ±5 
0.091 ±0.005 


16±2 
0.084 ±0.06 


33 ±4 
0.061 ±0.005 


26 ±4 
0.059 ±0.008 


66 ±7 
0.087 ±0.007 


A-rUTP* 


/Cdi (^M) 
K2 


210 ±24 
0.083 ±0.007 


92 ±8 
0.16 ±0.00 


126 ±7 
0.056 ±0.003 


98 ±10 
0.059 ±0.007 


280 ±36 
0.073 ±0.013 


A-rGTP 


/Cdi C^iM) 
K2 


30 ±7 
0.13 ± 0.01 










G-dATP 


/Cdi (|iM) 
K2 


230 ± 67 
0.55 ±0.04 






90 ±54 
0.076 ±0.030 




G-dCTP 


/Cdi (^M) 
K2 

/Cd2 (|iM) 


1.2 ±0.4 
6.1 ±1.2 
0.20 ±0.07 


70 ±11 
1.7 ±0.1 
42 ±7 




1.0 ±0.2 
8.9 ±1.2 
0.11 ±0.02 




G-dTTP 


/Cdi (^iM) 
K2 


230 ±76 
0.10 ±0.02 






49 ±30 
0.22 ±0.04 




T-dATP 


/Cdi C^iM) 
K2 

/Cd2 (|iM) 




82 ±10 
1.8 ±0.1 
45 ±6 




38 ±6 
0.90 ±0.04 
42 ±7 




T-dGTP 


/Cdi (^iM) 
K2 








30 ±3 
0.057 ±0.005 




dGTP 
(no DNA) 


/Cdi (|iM) 
K2 


3.0 ±0.7 
0.19 ±0.03 










rGTP 

(no DNA) 


/Cdi (^M) 
K2 


12±4 
0.080 ±0.057 











^Determined using nucleotide titrations (Figs 3 and 4, Supplementary Figs S2 and S3) as described in Methods (data represented as mean ±95% confidence interval representing the fitting error). 
tSee Fig. 3c for a description of the presented constants. 

ICalculated data for these base pairs were derived from three independent measurements. 
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Figure 4 | Conformational profiles depend on the tempiating base, (a-c) FRET histograms for binary and ternary complexes formed using wt KF (a), 
E710A (b) and Y766A (c). Conditions and style are as in Fig. 2, except that the DNA substrates were DNA2 (template G) or DNA3 (template T); for 
sequences, see Fig. Id. (d,e) Nucleotide-concentration dependence of the conformational distribution of the E710A ternary complex with G-dCTP 
and T-dATP base pairs (d) and of the Y766A ternary complex with G-dCTP, T-dATP and T-dGTP base pairs (e). Data analysis was as in Fig. 3; the results 
are shown in Table 1. In e, the error bars on the G-dCTP plot represent the s.e. from two independent titrations; all other titrations represent single 
measurements. 



correctly paired A-dTTP ternary complex; however, the 
subsequent transition to the closed complex is not favoured 
(K2<0.2, Table 1). 

The conformations of wt KF with complementary and 
mispaired dNTPs at a template G and a template T (Fig. 4a) 
were similar to those for template A (Fig. 2a). 



E710 has a critical role in the fingers-closing transition. 

The conformational profiles of unliganded E710A and its 
binary complex with DNAl resemble those of wt KF (Fig. 2b, 
rows 1 and 2). However, addition of the complementary 
nucleotide (A-dTTP; Fig. 2b, row 3), even at high concentrations 
( > 1 mM), led to only a modest increase in the occupancy of 
the closed state. Instead, the bulk of the population shifted from 
the open state (£"^^0.44) to an intermediate -FRET species 
(£^-0.48). The K^i for the E710A A-dTTP ternary complex was 
43 jiM (Table 1 and Fig. 3b), similar to that for wt KF with a 
mispair (A-dGTP), but several fold higher than that for wt KF 
with the complementary A-dTTP. Further, the K2 for the E710A 
A-dTTP ternary complex was < 1, similar to mispairs with wt 
KF (Table 1), preventing efficient formation of the closed con- 
formation. The interactions of E710A with both a non-com- 
plementary dNTP (A-dGTP) and a complementary rNTP (A- 
rUTP) showed the same shift towards the £'*^~0.48 state as with 
wt KF (Fig. 2b). With A as the tempiating base, E710A binds the 
incorrect dGTP 3 -fold more tightly (K^i = 16 |iM) than does wt 
KF (K^i = 46 jiM); interestingly, this incorrect interaction is also 
threefold tighter than the binding of the correct dNTP (A-dTTP) 
by E710A. For the complementary rNTP (A-rUTP), both K^i and 



K2 are ~ 2-fold more favourable for E710A than for wt KF, 
probably owing to easier accommodation of the ribose 2' -OH 
when the glutamate side chain is replaced by alanine. 

The behaviour of E710A at template G and template T (Fig. 4b) 
was as described for template A, except in the presence of 
complementary dNTPs. Despite weak initial binding of 
the complementary dNTPs, the more favourable K2 values 
of ~ 2 resulted in greater occupancy of the closed conformation 
(61% for G-dCTP and 58% for T-dATP at 1 mM dNTP); 
however, the overall affinity of E710A for all three complemen- 
tary base pairs is low (Fig. 4d, Table 1), indicating that E710A 
discriminates poorly between correct and mispaired dNTPs 
regardless of tempiating base. The conformational profile for 
E710Q at template A (Fig. 2d) was similar to that of E710A 
(Table 1). 



Y766A perturbs fingers-closing for complementary dNTPs. 

With template bases A and T, the Y766A mutator followed the 
pattern seen for E710A and E710Q (Figs 2d and 4c). Addition of 
complementary nucleotides (A-dTTP or T-dATP) led to equal 
occupancy of the intermediate- FRET and high-FRET states at 
high nucleotide concentrations (>lmM). Addition of incorrect 
nucleotides (A-dGTP, G-dATP, or T-dGTP) formed pre- 
dominantly the intermediate-FRET species. At template A and T, 
all dNTPs showed weak binding regardless of their com- 
plementarity to the tempiating base (Table 1). 

However, at template G, Y766A bound the complementary 
dCTP as strongly as wt KF (Kji^LOjiM for Y766A versus 
1.2 |iM for wt KF, Fig. 4e) and formed the closed complex almost 
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exclusively, reflecting a very favourable K2 (Table 1). Moreover, 
the single-turnover kinetic parameters for G-dCTP incorporation 
by Y766A and wt KF (Supplementary Table S2) are similar, in 
contrast to those for A-dTTP incorporation. Thus, at template G, 
Y766A distinguishes well between correct and incorrect dNTPs. 

The complexes of Y766F, which has similar fidelity to wt 
j7-pii,i4^ resembled those of wt KF (Fig. 2e, Table 1), the only 
difference being a ~ 3-fold decreased binding affinity (Kdi) for 
the complementary nucleotide (A-dTTP) and an increased 
K2 (9.6) so that the overall K^2 is ~ 2-fold weaker for Y766F. 



Y766 modulates the equilibrium in unliganded polymerases. 

Whereas the unliganded wt KF, E710A and E710Q polymerases 
exhibit similar conformational equilibria, Y766 substitutions shift 
the equilibrium (Fig. 2d,e). Loss of the bulky phenyl ring from the 
Y766 side chain increases the open- state occupancy to 78% 
(Y766A), whereas the more conservative Y766F substitution, 
which removes a potential hydrogen-bond donor, decreases the 
open-state occupancy to 28%. 

The large differences between wt KF, Y766A and Y766F 
indicate that the interconversion rates between open and closed 
states depend on residue 766. To characterize the equilibria, we 
used probability distribution analysis (PDA)^'^^'^^ and burst 
variance analysis (BVA)^'^^, both described in Methods. PDA 



revealed that the FRET distribution fits well either to a 'static' 
model of two independent molecular entities or a 'dynamic' 
model ^^'^^ that invokes dynamic interconversions of a single 
molecular entity between two distinct FRET states (Fig. 5a, top). 

We used BVA to show that the dynamic model prediction 
fitted the data significantly better than the static model (Fig. 5a), 
suggesting the presence of dynamic heterogeneity for all three KF 
derivatives. To further analyse the conformational dynamics, we 
performed PDA using the FRET efficiencies E\ obtained from the 
initial PDA fit (dynamic model), and plotted the /^-values as a 
function of ki and k_i (Fig. 5a, bottom). For wt KF, the 
minimization surface forms a distinct well at /ci = 160s~^ and 
/c _ 1 = 320 s ~ ^ {x^< 2), consistent with our previous reports^'^'^^. 
In contrast, both Y766A and Y766F display different landscapes 
(Figs. 5b,c), whereas Y766A shows a distinct well at A;i = 15 s~ ^ 
and A; _ 1 = 120 s ~ ^ Y766F displays a broad range of possible rate 
combinations with a shallow minimum at ki ~ 450 s ~ ^ and 
A;_i-120s-^ 



Accessing the equilibrium dynamics of wt KF complexes. To 

characterize the timescales of interconversions within the sequential 
four-state model (Fig. 3c), we used PDA and a novel analysis of 
FRET distribution widths. Using PDA, we characterized the inter- 
conversions between O and C for the wt KF binary complex, and 



wt KF 



Y766A 



Y766F 



Data — Stat, x^ = ^.5 — dyn. = 2.0 



Data; —Stat. %2 = 1.7 — dyn. %2= 1.6; 



Data; — stat. x^ = ^.7 — dyn. %2 = 1 .5 



0.25 
0.2 




800 



600 



400 



200 



0.2 0.4 0.6 0.8 

FRET efficiency P 



0.2 0.4 0.6 0.8 

FRET efficiency E* 



0.2 0.4 0.6 0.8 

FRET efficiency E* 




800 



600 



400 



200 




800 



600 



400 



200 




1- 

I 



10 

8 

6 

4 

2 



0 200 400 600 800 
Closed to open rate /c_-|(s~^) 



0 200 400 600 800 
Closed to open rate /c_-|(s~^) 



0 200 400 600 800 
Closed to open rate k_^{s~^) 



Figure 5 | Conformational dynamics of unliganded wt KF and Y766 derivatives, (a) Analysis of wt KF using PDA (upper panel), BVA (middle panel) and 
a rate-dependent PDA ;(^-nninimization surface (lower panel). The P histogrann was fitted using PDA, assuming either a static (black line) or a dynamic 
(red line) two-species model, wherein the polymerase occupies either the closed or open state yielding /ci = 150s~ \ and /c_t = 320s~^ for the dynamic 
model (see Supplementary Methods for fitting parameters). Whereas PDA achieved satisfactory fits to the data in either scenario, as shown by the 
residuals plot (x^<2), the BVA data (grey line) favoured the dynamic model (red line). The PDA ^(^-minimization surface uses the mean FRET positions and 
excess widths from the dynamic fit, plotting the /^<10 values as a function of the rates. Best fits are obtained for k-\^^60s~^ and /c_t~320s~1 
(b,c) Analysis of conformational dynamics for Y766A and Y766F, as in a, yielding interconversion rates of /ct = 9s~\ /c_t = 128s~^ for Y766A and 
/ct = 435s~ \ and Ic_t = 108s~^ for Y766F. For fitting parameters, see Supplementary Methods; for PDA of the wt KF binary complex and the wt KF ternary 
complex with an incorrect nucleotide, see Supplementary Fig. S4. 



6 



NATURE COMMUNICATIONS | 4:2131 | DO!: 10.1038/ncomms3131 | www.nature.com/naturecommunications 
© 2013 Macmillan Publishers Limited. All rights reserved. 



NATURE COMMUNICATIONS | DPI: 10.1038/ncomms3131 



ARTICLE 



between IN and CN for the wt KF ternary complex with an A-dGTP 
mispair at saturating nucleotide conditions (Supplementary Fig. S4); 
this analysis yielded rate constants of A;o^c=15s~^ and 
/cc^o = 85 s ~ ^ for the binary complex and A;in^cn = 3 s ~ ^ and 
^CN^iN = 45 s ~ ^ for the mispaired ternary complex. 

Accessing the remaining rates (corresponding to rate constants 
/co^iN and /cin^o) is challenging, as the analysis of the low- 
FRET species is hampered by our inability to resolve the FRET 
distributions of the O and IN states. Width analysis (see 
Methods) of our data (Supplementary Fig. S5c) revealed broad- 
ening of the low-FRET species around K^i for several nucleotide 
titrations (for wt KF: A-dGTP and A-rUTP; for E710A: A-dTTP, 
A-dGTP and A-rUTP), indicating that the interconversions are at 
the millisecond timescale or slower. For the E710A A-rUTP 
titrations, for example, there is a maximum broadening of ~ 6% 
at 100 \xM rUTP, close to the corresponding K^i of 92 |iM. For a 
better estimate of the timescale, we simulated the four- state 
model using four fixed rates (by fixing /cq^cj ^c^o> ^in^cn 
and /ccn^in) and varying /cq^ in and /cin^o (Supplementary 
Fig. S5d; as in Santoso et al7). The width of the low-FRET species 
at a nucleotide concentration equal to K^i remains constant for 
high IN-to-O rates (>300s~^), whereas for rates of <300s~^ 
there is a significant width increase around Kji. We thus propose 
a rate of 300 s ~ ^ as an upper limit for the IN-to-O transition. 

Discussion 

Previously, we described a FRET state between the open and 
closed states for wt KF ternary complexes with incorrect 
nucleotides^. Data presented here strongly suggest that wt KF 
accesses this intermediate state even with complementary dNTPs, 
consistent with this state being on the path to nucleotide 
incorporation. Whereas the intermediate-FRET state is sparsely 
populated in the ternary complex of wt KF with a complementary 
dNTP, the three mutators (Y766A, E710A and E710Q) behave 
differently; their ternary complexes with complementary dNTPs 
occupy mainly the intermediate-FRET state, with Y766A and the 
G-dCTP base pair being the only exception. 

Initially, we attempted to relate the intermediate-FRET species 
to available cocrystal structures, speculating that the ternary 
complex with a complementary rNTP undergoes partial fingers 
closure, whereas the mispaired ternary complex resembles a 
fingers-open conformation^. Since then, a better structural 
candidate for the intermediate-FRET species has been 
described^^ Using Bacillus stearothermophilus DNA polymerase 
(a KF homologue), Wu and Beese^^ captured a partially closed 
('ajar') fingers conformation (Fig. 6a) in a ternary complex with a 
G-dTTP mismatch, using both a mutator (Y714S, equivalent to 
Y766S in KF) and a non-mutator (F710Y, equivalent to F762Y in 
KF) polymerase; the partially closed state corresponds to fingers- 
closing along the open-to-closed transition, leading to a distance 
change of ~ 2 A between the sites carrying the FRET probes, in 
excellent agreement with our observations. 

Evidence for a partially closed state was also reported in a 
recent FRET study on immobilized KF complexes^^. In contrast 
to our assay, which reports directly on conformational states by 
monitoring an intra-protein distance, Berezhna et al?'^ used 
donor-labelled DNA and acceptor-labelled polymerases to infer a 
protein conformational change by measuring a DNA-protein 
distance; in the latter assay, any protein conformational changes 
may be superimposed on concurrent DNA conformational 
changes, as well as on possible polymerase repositioning relative 
to the DNA. 

The partially closed cocrystal structures may also relate to other 
intermediate-FRET species we have observed. By introducing a 
mutation that prevents full closing of the O -helix, Wu and Beese^^ 



captured a partially closed B. stearothermophilus ternary complex 
with a correctly paired dNTP, a candidate for the sparsely 
populated species of intermediate-FRET value in the ternary 
complexes of wt KF with correct dNTPs. More speculatively, we 
note that the Pol- dNTP binary complex (without DNA) has a 
similar intermediate-FRET signature (Supplementary Fig. S2e; see 
also Santoso et al7), suggesting that O-helix movement might be 
initiated by dNTP binding in the KF cleft without direct DNA 
involvement. Our iQ measurements (Table 1) indicate that 
nucleotide binding is destabilized by the presence of DNA with 
a non- complementary template base, just as DNA binding is 
destabilized by a mismatched dNTP (O.B., N.D.F.G., C.M.J, 
unpublished)^ ; both cases promote dissociation of mispaired 
ternary complexes. 

It seems reasonable to assume that the partially closed complex 
is on the pathway to the chemical step, given that the partially 
closed state is structurally part way between open and closed 
states. We cannot directly rule out the alternative scenario 
whereby the partially closed conformation is an 'off-pathway' 
state in equilibrium with the open but not the closed state as, in 
either case, the equilibria follow similar relations (Supplementary 
Methods). 

The partially closed conformation has the hallmarks of an early 
checkpoint, being populated in all wt KF ternary complex species, 
although these species progress differently along the reaction 
pathway. This conformation must be formed at an early step, 
most probably immediately after the Pol- DNA binary complex 
binds a nucleotide and its Mg^+ ion (step 2, Fig. 6b). Although 
our FRET pair does not distinguish between mispaired and 
rNTP -containing ternary complexes, experiments using 2-ami- 
nopurine on DNA substrates show that wt KF complexes with a 
complementary rNTP proceed further than mispairs (O.B., 
N.D.F.G., C.M.J, unpublished)^^. A nanopore-based study of 
single KF-DNA complexes also supports this conclusion^"*. The 
partially closed conformation, therefore, encompasses at least two 
related ternary complexes (Fig. 6b) whose equilibria determine 
the progression of a particular ternary complex towards product 
formation. First, the partially closed ternary complex undergoes a 
base-checking step (step 2.1) in which the equilibrium is 
unfavourable for all non-complementary nucleotides (dNTPs 
and rNTPs), thus rejecting six out of the eight nucleoside 
triphosphates. Next, the sugar structure is tested against the 
active-site pocket (step 2.2), rejecting the bulkier rNTPs and 
allowing complexes containing complementary dNTPs to proceed 
to the closed conformation, from which they undergo subsequent 
steps, including D705 reorientation and binding of the second 
Mg^+ ion^^. 

The calculated equilibrium constants (Table 1) enabled us to 
propose a free- energy landscape for the prechemistry nucleotide 
selection steps (Fig. 6c). Although a simplified landscape has 
been proposed before (on the basis of fluorescence enhancement 
of a dye-labelled DNA upon T7 DNA polymerase binding^^), 
our ability to monitor the conformational coordinate directly 
allowed us to add the important partially closed conformation 
checkpoint. 

The dissociation constant for a nucleotide in the partially 
closed ternary complex, K^i, reports on interactions in the wt KF 
active site. The binding of the complementary dNTP is favoured, 
leading to a stabilized partially closed state ( — 8 kj mol ~ * for 
A-dTTP versus — 0.7 kJ mol ~ * for A-dGTP, using the nucleotide 
concentrations in E. coli^^); this may result from tighter binding 
within either or both of Epc»DNA»r/dNTP-Mg^ + and 
Epc^DNA'^^r/dNTP-Mg^ + (see Fig. 6b). Moreover, the similar 
K^i values for A-dGTP and A-rGTP complexes (Table 1) indicate 
that the ribose 2'-OH is not detected in the initial Epc»DNA»r/ 
dNTP-Mg2+ complex. 
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Figure 6 | Roles of the partially closed state in nucleotide selection, (a) Proposed structure of the polymerase confornnation corresponding to the 
intermediate-FRET state. The crystal structure of the Pol-DNA binary complex (PDB accession code 1L3U, mobile section green)^ is superimposed on the 
crystal structures of a ternary complex where the fingers adopt a partially closed ('ajar') conformation (PDB accession code 3HP0, mobile section yellow)^^ 
and a ternary complex where the fingers adopt the closed conformation (PDB accession code 1LV5, mobile section orange)^. The distances between the Cp 
positions of residue 550 (red sphere on the thumb) and 744 (sphere on the fingers) are 5.2 nm in the open, 5.0 nm in the partially closed and 4.2 nm in the 
closed conformation. The predicted distance change of ~2 A on going from the open to the partially closed conformation is consistent with the observed 
shift of AP~0.04, as we operate in the linear part of the FRET dynamic range, (b) Proposed reaction pathway for KF from DNA binding to phosphoryl 
transfer, based on references^'^^'^-^ and our current results. Eq and Ec represent the open and the closed conformations, respectively, whereas Epc 
represents the intermediate-FRET state interpreted as a partially closed conformation, (c) Proposed free-energy landscapes for the prechemistry nucleotide 
selection steps prepared using the equilibrium constants in Table 1 and in vivo dNTP concentrations^^ (green curve: complementary nucleotide A-dTTP; red 
curve: non-complementary nucleotide A-dGTP; see also Methods). Reaction coordinate: degree of fingers-closing (shown to scale). The free energy of the 
open binary state is set to zero. Binding of a nucleotide stabilizes the complex with the partially closed conformation, either weakly (incorrect base pair 
A-dGTP) or strongly (correct base pair A-dTTP). Equilibration to the fully closed conformation is favoured only for the correct nucleotide. Although the 
exact activation energy barriers (dotted lines) between the conformations are unknown, we included the rates for the transition of the A-dGTP ternary 
complex between the open and partially closed states (estimated using FRET-width analysis; Supplementary Fig. 55) and the rates for the transitions 
between the partially closed and the closed states for correct and incorrect nucleotides (determined using PDA; Supplementary Figs Sic and S4, 
respectively). 



X2, describing the equilibrium between partially closed and 
closed complexes, defines the free energy of the closed ternary 
complex. When K2 is > 1, as for correctly paired ternary 
complexes of wt KF, the closed complex is the lowest- energy 
species ( — 4kJmol~^ relative to the A-dTTP partially closed 
state) on the prechemistry pathway, and its enhanced stability 
relative to the partially closed ternary complex (Kd2<^di) drives 
the forward reaction. With an inappropriate substrate, such as a 
mispair, K2 is < 1 so that the closed complex is destabilized 
relative to the partially closed ternary complex ( + 6 kj mol ~ ^ for 
A-dGTP) and the equilibrium favours species formed early in the 
pathway. The extremely slow rates for misinsertion (>10^-fold 
slower than complementary dNTP additions) probably result 
from low occupancy of the closed conformation combined with 
active- site distortions 

8 



The fidelity mutants in this study show a reduced ability to 
recognize complementary dNTPs as 'correct' (Table 1), having 
lower affinity for the initial binding of the complementary dNTP to 
the partially closed complex and an unfavourable K2. Therefore, the 
partially closed conformation is highly populated in the resulting 
ternary complexes so that, in the absence of the E710 or Y766 side 
chains, the complex with the complementary dNTP resembles a 
mispaired complex. Our data show a continuum of behaviours, 
with both the templating base and the particular mutator allele 
influencing how far the conformational equilibria shift towards the 
closed state; at one extreme is the E710A A-dTTP complex, which 
has little of the closed conformation {K2 = 0.25), at the other is the 
Y766A G-dCTP complex, which resembles wt KF {K2 = 8.9); in 
between are several complexes with ^2^1, having substantial 
amounts of both closed and partially closed complexes. 
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Why should alteration of E710 or Y766 destabilize the closed 
ternary complex? Structurally, E710 and Y766 define much of the 
surface of the active site-binding pocket so that they interact more 
closely with the nascent base pair in the closed complex than in 
the partially closed complex, where the base pair is more solvent 
exposed. Surprisingly, the hydrogen bond between E710 and 
Y766, despite being invariant in A-family DNA polymerases, has 
a minor role in the closed complex stability, as shown by the 
similar FRET profile for complexes of Y766F and wt KF. More 
important may be the overall size and shape of the 766 side chain 
and its ability to interact with and position the nearby F762 side 
chain, which forms part of the binding pocket. 

Structural studies argue that the transition from partially 
closed to fully closed conformation is required for catalysis of the 
polymerase reaction. In the partially closed conformation, the 
primer 3' -OH is too far from the a-phosphate of the bound 
nucleotide; moreover, the rest of the active site, including the 
catalytic metal ions, is not properly assembled for a chemical 
attack^ ^ Our data support this reasoning, showing a rough 
correlation between the rate of nucleotide incorporation 
(/Cpoi, Supplementary Table S2) and closed- complex occupancy; 
the most notable example is G-dCTP incorporation by Y766A, 
where high occupancy of the closed complex correlates with 
near-wild-type kinetics. Moreover, for the mutator proteins, the 
single- turnover for A-dTTP incorporation (i^d(dNTP)> 

Supplementary Table S2) approximates to (Table 1), 

consistent with closed- complex formation being required for 
the chemical step. (This assumes that the chemical step is rate 
limiting for these mutant proteins. For wt KF (and probably 
Y766F), the same relationship does not hold, as prechemistry 
step 3 is rate limiting)^^. Nevertheless, as the low incorporation 
rate for E710Q shows (Supplementary Table S2), achieving the 
closed state with moderate efficiency is, of itself, no guarantee of 
effective catalysis if the mutation distorts the final active -site 
geometry. 

Our data support the generalization that alterations in 
polymerase fidelity correlate with the kinetics of handling correct, 
rather than incorrect, nucleotides^^. Compared with wt KF, the 
mutator variants show at most a threefold enhancement in the 
binding (Kai) of the non- complementary A-dGTP and no 
improvement in K2. This contrasts with the much larger deficits 
in both constants for the mutators when binding the correct 
A-dTTP. As the closed complex selects for the correct template- 
dNTP pairing, any change to the energetics of this complex will 
exclusively affect correct dNTP incorporation. Moreover, even in 
the partially closed conformation, changes in active-site side 
chains diminish specificity for the correct pairing more than they 
enhance the interaction with mispairs. 

The effects of template base and mutator allele on closed- 
complex energetics provide a partial explanation of the 
observed error specificities of the KF mutators (Supplementary 
Fig. S6)^^~^'*. Closed complexes with the correct dNTP are 
least populated when E710A encounters template A or when 
Y766A encounters template A or T, making these the situations 
where recognition of the correct base pair is least effective. 
One would therefore expect E710A to be less accurate at template 
A than that at G or T, and Y766A to be less accurate at templates 
A and T, relative to G. This is consistent with the high frequency 
of A-dCTP errors made by E710A and T-dGTP made by 
Y766A but does not explain the lower frequency of other errors 
at these particular template bases. The substantial frequency 
of Y766A G-dTTP errors is noteworthy, given that Y766A 
handles the G-dCTP pair with almost wild-type efficiency; 
this could be a case where a mutator actually facilitates the 
interaction with the mispair, consistent with earlier kinetic 
data^"*. As the experimentally observed polymerase errors in 



Supplementary Fig. S6 require not only covalent addition of 
mispaired bases but also extension of the resulting mismatched 
primer termini, they will be influenced by steric or geometric 
constraints in intermediates for both reactions so that the 
overall error specificity reflects the combination of several 
factors. 

Finally, our results on the unliganded Y766 polymerase 
derivatives reinforce our previous observations that the con- 
formational equilibrium of the unliganded polymerase is delicate; 
thus, the removal of even a single oxygen atom (as in Y766F) 
increased substantially the occupancy of the closed 
state for the unliganded polymerase, whereas the more drastic 
Y766A mutation had the opposite effect. By contrast, changes 
to the 710 side chain did not influence the conformational 
equilibrium, suggesting that the behaviour of Y766 derivatives 
is not directly related to the missing 710-766 hydrogen bond 
in the closed conformation. An alternative explanation considers, 
instead, the open conformation, where the 766 side chain is 
extremely solvent exposed; one might expect the side chains 
to favour the open conformation in the reverse order of 
their hydrophobicity: A>Y>F, as is observed. Unlike the 
unliganded polymerase, the FRET distributions of binary 
complexes of the mutant and wt polymerases were similar, 
indicating that the conformational landscape is dominated by 
Pol-DNA interactions, and that the apparent conformational 
equilibrium of unliganded polymerases is not a reliable predictor 
of the conformational landscape of the corresponding binary 
complexes. 

Methods 

Single-molecule FRET measurements. Single-molecule FRET experiments 
were performed at room temperature ( » 22 °C) using a home-built confocal 
microscope equipped with 20 kHz alternating laser excitation between a 532-nm 
(operated at 240 |aW) and a 638-nm laser (operated at 60 |iW), coupled to a x 60, 
1.35 NA, UPLSAPO 60XO (Olympus) objective. The alternating laser excitation 
allows filtering for correctly labelled species bearing an active donor and an 
active acceptor^^. After filtering each fluorescent burst for the correct labelling 
stoichiometry, we calculated the apparent FRET efficiency E'^ for each burst as 
£'*^= DA/(DD + DA), where DA is the number of photons in the red detection 
channel after green excitation and DD the number of photons in green detection 
channel after green excitation. Doubly labelled KF derivatives were present at 
< 100 pM in single-molecule observation buffer (40 mM 4-(2-hydroxyethyl)-l- 
piperazineethanesulfonic acid (HEPES)-NaOH, pH 7.3, 10 mM MgCl2, 1 mM 
dithiothreitol, 100|igml~^ bovine serum albumin, 5% (vol/vol) glycerol, 1 mM 
mercaptoethylamine) . 

Analysis of FRET histograms. Single-molecule FRET experiments are sensitive to 
subtle changes in conditions such as room temperature and set-up alignment^^'^^ 
To minimize changes, every set of experiments contains reference measurements of 
a DNA FRET standard with a reference FRET value of £^~0.5 (ref. 32), the 
unliganded polymerase, the binary complex formed with DNA and the ternary 
complex with DNA and nucleotides. Each titration experiment was completed in a 
single day. 

The stoichiometry- filtered E"^ FRET histograms were fitted to double- Gaussian 
functions that were constrained by a maximum width of a^ax (E"^) ^ 0.07. The 
double Gaussians report on peak positions and relative occupancies. For titration 
experiments, we plotted the relative peak shift (^'^obs — £%en)/(£ '^closed — E^p^n) of 
the low-FRET Gaussian and the fractional occupancy of the high-FRET Gaussian 
as a function of nucleotide concentration. We fixed the position of the high-FRET 
Gaussians to the value obtained from a free fit of the binary complex or a saturated 
ternary complex with complementary dNTPs. We did not fix the position of the 
low-FRET Gaussian. 

Derivation of the multistate equilibrium model. To determine the parameters 
governing the equilibria between the species observed in the single-molecule 
FRET histograms (Fig. 3c), we derived equations for the two concentration- 
dependent FRET observables: the peak position of the low-FRET Gaussian and the 
fraction of the population in the high-FRET Gaussian. The binary complex is in 
dynamic equilibrium between open (O, low-FRET) and closed (C, high-FRET) 
conformations, described by the equilibrium constant Ko= [C]/[0]. Binding of a 
nucleotide (N) to the open conformation, forming the intermediate- FRET ternary 
complex (IN), is described by the association constant Ki = [IN]/[0][N]. The 
intermediate- FRET ternary complex can undergo an additional conformational 
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transition to the closed ternary complex (CN, high-FRET), governed by the 
equilibrium constant K2 = [CN]/[IN]. The corresponding dissociation constants 
are defined as Kai = 1/Ki for the formation of the intermediate- FRET state and 
= l/{KiK2) for the formation of the high-FRET (fully closed) state. The total 
polymerase concentration is 

[Total] = [0] + [C] + [IN] + [CN] 

= [O] + Ko [O] + Ki [O] [N] + K,K2 [O] [N] . 

The fractions of each species as a function of nucleotide concentration are: 

[O] / [Total] = 1 / ( 1 + Ko + Ki [N] + Ki K2 [N] ) , (2) 

[C]/[Total] =Ko/(l +Ko +Ki[N] +KiK2[N]), (3) 

[IN] / [Total] = Ki [N] / ( 1 + Ko + Ki [N] + Ki K2 [N] ) , and (4) 

[CN]/[Total] =KiK2[N]/(l +Ko +Ki[N] +KiK2[N]). (5) 

The two FRET observables are given (a) by the normalized peak shift of the 
low-FRET Gaussian 

(^:bs - ^:pen)/(£c*losed ' KpJ = 7max [IN] / ( [O] + [IN]) 

= 7maxKi[N]/(l+Ki[N]) (6) 

[N]/(Kdi + [N]), 

where )/niax corresponds to the normalized peak shift at saturating nucleotide 
concentration and (b) by the fraction of high-FRET species 

([C] + [CN])/[Total] = (Ko +KiK2[N])/(l +Ko +Ki[N] +KiK2[N]). (7) 

We used a global fitting approach to fit both FRET observables simultaneously as a 
function of nucleotide concentration, and to determine the equilibrium constants 
Kq, Ki and K2, and the parameter j/max- From the equilibrium constants and their 
s.e., we calculated the dissociation constants. Identical equations can be derived to 
describe nucleotide binding to the polymerase in the absence of DNA; in this case, 
O and C refer to the unliganded protein. 

Probability distribution analysis and burst variance analysis. For PDA^, the 
static model is fully described by two peak positions E% two excess widths 
(accounting for additional broadening of <E'^i> due to acceptor 
photophysics^^'^^) and two relative occupancies N^. In comparison, the dynamic 
model replaces the relative occupancies with two rate constants: ki for the open-to- 
closed transition and k^i for the closed-to-open transition. We optimized 
parameters for both models using a least mean square algorithm implemented in 
MATLAB. BVA directly evaluates FRET distributions for the presence of dynamic 
heterogeneity^^. In BVA, the experimentally measured s.d. of the FRET efficiency 
for individual transits through the confocal spot is compared with the s.d. predicted 
by the static and dynamic models. 

Width analysis for obtaining rate constants. At nucleotide concentrations equal 
to the dissociation constant K^i (where [O] equals [IN]), the interconversion rates 
between O and IN, /cq^ in and ki^^o, are equal. These rates also control the width 
of the low-FRET species distribution: if ko^m and /cin^o are high and lead to 
multiple interconversions during the transit of the complex through the detection 
volume ( ~ 3 ms (ref. 7)), the FRET averaging between O and IN will yield a FRET 
width for the low-FRET species that matches that for the binary or partially closed 
ternary complex for all nucleotide concentrations. In contrast, slower 
interconversions will lead to a FRET-width increase; this increase should peak 
around Kji (Supplementary Fig. S5a,b). 

Calculating the free-energy landscape of ternary complexes. To construct the 
free-energy landscape for the prechemistry conformations of wild-type DNA 
polymerase in the presence of nucleotides, we calculated the relative Gibbs free 
energies A^G for the partially closed conformation and the fully closed con- 
formation in ternary complexes by using the open conformation in the binary 
complex as a reference. To estimate zlrG(o/iN)> the free-energy difference between 
the open conformation and the partially closed conformation, for a complementary 
dNTP (A-dTTP), we used: A^G = A^G^ + RT \n{Q,). The first term, A^G^, is the free 
energy corresponding an O-to-IN equilibrium with 1 M concentrations for all 
species involved and is calculated using ArG^ = — RT In(Ki), where R is the ideal 
gas constant, T the temperature and K^ is the equilibrium association constant 
measured using our nucleotide titration experiments (Table 1). The second term 
requires the reaction quotient Qr, which is a function of the actual concentrations 
of all species involved in the equilibrium: Qr = {[IN]/[0][N]}. To estimate Qr, we 
set [N] to the E.coli intracellular concentration for dTTP (120 |iM)^^, and assumed 
that [IN] = [O] . To estimate Aj-G(in/cn)> the free-energy difference between the 
partially closed conformation and the closed conformation with bound nucleotide, 
we used ArG(iN/cN) = — RT ln(K2). Similar expressions were used for the free 
energies of the ternary complexes formed upon binding to an incorrect dNTP 
(A-dGTP); for dGTP, we used [N] = 60 |iM^^. Using the proposed rate of 
~ 300 s ~ ^ as an upper limit for the rate of the IN-to-O transition for A-dGTP, we 



calculate the rate of the O-to-IN transition at 60|iM dGTP in E.coli as follows: 
rate = Ki • V-o • 60 |iM = 21.7»103 M'^ • 300s-^ • 60»10-^M = 390s-^ 

The following procedures are further described in the Supplementary 
Methods: DNA preparation, protein expression and purification, and site-specific 
labelling of KF. 
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